Supporting a Social Media Observatory with Customizable Index Structures: Architecture and Performance

نویسندگان

  • Xiaoming Gao
  • Evan Roth
  • Karissa Rae McKelvey
  • Clayton A. Davis
  • Andrew J. Younge
  • Emilio Ferrara
  • Filippo Menczer
  • Judy Qiu
چکیده

The intensive research activity in analysis of social media and microblogging data in recent years suggests the necessity and great potential of platforms that can efficiently store, query, analyze, and visualize social media data. To support these “social media observatories” effectively, a storage platform must satisfy special requirements for loading and storage of multi-terabyte datasets, as well as efficient evaluation of queries involving analysis of the text of millions of social updates. Traditional inverted indexing techniques do not meet such requirements. As a solution, we propose a general indexing framework, IndexedHBase, to build specially customized index structures for facilitating efficient queries on an HBase distributed data storage system. IndexedHBase is used to support a social media observatory that collects and analyzes data obtained through the Twitter streaming API. We develop a parallel query evaluation strategy that can explore the customized index structures efficiently, and test it on a set of typical social media data queries. We evaluate the performance of IndexedHBase on FutureGrid and compare it with Riak, a widely adopted commercial NoSQL database system. The results show that IndexedHBase provides a data loading speed that is six times faster than Riak and is significantly more efficient in evaluating queries involving large result sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maraghe Observatory and an Effort towards Retrieval of Architectural Design of Astronomical Units

Maraghe observatory was built by such engineers as Moayiededdin Orozi etc. under supervision of Khaje Nasireddin Tousi in 7th century AH. The most significant feature associated with Maraghe observatory is the fact that architecture is employed to achieve astronomical purposes in this site. The reason for preferring observatory by astronomers was the fact that these units are superior to wooden...

متن کامل

Creativity, Design Studio Performance, and Social Media: A Study of Instagram Use among Architecture Students

The importance of using visual social media as the digital learning and inspiration resources in architecture is blatantly obvious. On the contrary, there are still gaps in the position of those platforms in the elements of creativity and performance within design studios. The major research question is how does the architecture students' use of architectural content on Instagram relate to thei...

متن کامل

Green envelopes classification: the comparative analysis of efficient factors on the thermal and energy performance of green envelopes

This paper classifies green envelopes as green roofs and green walls according to effective factors, which were derived from literature to compare the green envelopes’ thermal and energy performance in a more effective way. For this purpose, an extensive literature review was carried out by searching keywords in databases and studying related journal papers and articles. The research meth...

متن کامل

Multidimensional Analysis of Distributed Xml Data

The expeditious proliferation of the internet to ubiquity, the infrangible dependence of global enterprises on Web services, the universal adoption of SOA, cloud computing, social media and online publishing has made XML the lingua franca of the digital age and has generated a plethora of data in XML. The immense popularity of NoSQL and document-oriented data stores have also added tremendously...

متن کامل

An integrated Assessment System of Citizen Reaction towards Local Government Social Media Accounts

Agovernmentshouldusesocialmediaforcommunicatingwithitscitizen.Theengagement index score is one of the methods for assessing the rate of governmental success in using social media as a tool in establishing interactive relationships with its citizen. In general, the engagement index score is obtained by calculating the number of posts, number of likes and comments, and so forth on a single social...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014